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ABSTRACT 

This paper studies decision problems for semigroups that are 
word-hyperbolic in the sense of Duncan & Gilman. A fundamen- 
tal investigation reveals that the natural definition of a 'word- 
hyperbolic structure' has to be strengthened slightly in order to 
define a unique semigroup up to isomorphism. It is proved that 
every word-hyperbolic semigroup has word problem solvable in 
polynomial time (improving on the previous exponential-time 
algorithm). Algorithms are presented for deciding whether a 
word-hyperbolic semigroup is a monoid, a group, a completely 
simple semigroup, a Clifford semigroup, or a free semigroup. 



1 INTRODUCTION 

The concept of word-hyperbolicity in groups, which has grown 
into one of the most fruitful areas of group theory since the publication of 
Gromov's seminal paper [Gro87], admits a natural extension to monoids via 
using Gilman's characterization of word-hyperbolic groups using context-free 
languages [G1I02], which generalizes directly to semigroups and monoids 
[DG04]. Informally, a word-hyperbolic structure for a semigroup consists of a 
regular language of representatives (not necessarily unique) for the elements 
of the semigroup, and a context-free language describing the multiplication 
table of the semigroup in terms of those representatives. 

This generalizaion has led to a substantial amount of research on word-hy- 
perbolic semigroups; see, for example, [CM12, FK04, HKOT02, HT03]. Some 
of this work has shown that word-hyperbolic semigroups do not possess such 
pleasant properties as word-hyperbolic groups: they may not be finitely pre- 
sented; they are not in general automatic or even asynchronously automatic 
[HKOT02, Example 7.7 et seq.]. 
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The computational aspect of word-hyperbolic semigroups has so far re- 
ceived limited attention. The only established result seems to be the solvablity 
of the word problem [HKOT02, Theorem 3.8]. In contrast, automatic semi- 
groups, which generalize automatic groups [ECH + 92] and whose study was 
inaugurated by Campbell et al. [CRRT01], have been studied from a computa- 
tional perspective, with both decidability and undecidability results emerging 
[Caio6, KO06, Ott07]. 

This paper is devoted to some important decision problems for word-hy- 
perbolic semigroups. Word-hyperbolic structures are not necessarily 'stronger' 
or 'weaker' computationally than automatic structures. As noted above, word- 
hyperbolicity does not imply automaticity for semigroups, so one cannot ap- 
peal to known results for automatic semigroups. A word-hyperbolic struc- 
ture encodes the whole multiplication table for the semigroup, not just right- 
multiplication by generators (as is the case for automatic structures). On the 
other hand, context-free languages are less computationally pleasant than reg- 
ular languages. For instance, an intersection of two context-free languages is 
not in general context-free, and indeed the emptiness of such an intersection 
cannot be decided algorithmically. Thus, in constructing algorithms for word- 
hyperbolic semigroups, it is often necessary to proceed via an indirect route, 
or use some unusual 'trick'. 

The most important result is a polynomial-time algorithm for solving the 
word problem (Section 4). As remarked above, the word problem was already 
known to be solvable, but the previously-known algorithm required time ex- 
ponential in the lengths of the input words [HKOT02, Theorem 3.8]. 

Some basic properties are then shown to be decidable (Section 5): being 
a monoid, Green's relations L, 3?, and 'K, being a group, and commutativity. 
These results are not particularly difficult, but are worth noting. 

The main body of the paper shows the decidability of more complicated 
algebraic properties: being completely simple (Section 6), being a Clifford 
semigroup (Section 7), and being a free semigroup (Section 8). 

Before embarking on the discussion of decision problems, it is necessary 
to make a fundamental study of the notion of word-hyperbolicity, because the 
natural notion of a word-hyperbolic structure, or more precisely an 'interpre- 
tation' of a word-hyperbolic structure, does not determine a unique semigroup 
up to isomorphism. A slightly strengthed definition is needed, and this is the 
purpose of the preliminary Section 2. 

The paper ends with a list of some open problem (Section 9). 



2 THE LIMITS OF INTERPRETATION 

Before developing any algorithms for word-hyperbolic semigroups, 
we must clarify the relationship between a word-hyperbolic structure (that is, 
an abstract collection of certain languages) and a semigroup it describes. A 
similar study grounds the study of decision problems for automatic semi- 
groups by Kambites & Otto [KO06], and our strategy and choice of terminol- 
ogy closely follows theirs. 

Let A be an alphabet. For any word w € A*, the reverse of w is denoted 
w rev YVe extend this notation to languages: for any language L C A*, let 
L rev = {w rev : w € L}. 

Definition 2.1. A pre-word-hyperbolic structure L consists of: 
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• a finite alphabet A(L); 

• a regular language L(£) over A(£); 

• a context-free language M(Z) over A(Z) U{#i , #2}, where #1 and #2 are new 
symbols not in A(I), such that M(I) C L(I)# T L(I)# 2 L(I) rev . 

When L is clear from the context, we may write A, L, and M instead of A(Z), 
L(Z), and M(L), respectively. 

'Pre-word-hyperbolic structures' consist only of languages fulfilling certain 
basic properties: there is no mention of being a structure 'for a semigroup' in 
the definition. Now, following Kambites & Otto for automatic semigroups 
[KO06, § 2.2], let us make a first attempt to turn the abstract pre-word-hy- 
perbolic structure into something that describes a semigroup. As well shall 
see, this definition is flawed, but it is instructive to see its consequences, since 
these illustrate why the improved Definition 2.7 is actually the correct one. 

Definition 2.2 ((First attempt)). An interpretation of a pre-word-hyperbolic 
structure L with respect to a semigroup S is a map cf) : A + — > S such that 
Lc() = S and 



When there is no risk of confusion, denote u<j) by u for any u € A + , and Xcf> 
by X for any X C A+. 

If a pre-word-hyperbolic structure L admits an interpretation with respect 
to a semigroup S, then I is a word-hyperbolic structure for S. 

Suppose L is a word-hyperbolic structure for S, as per Definition 2.2. Then 
words in A + represent elements of S, the regular language L contains at 
least one representatives for every element of S, and the context-free lan- 
guage M encodes the multiplication table for S in terms of representatives 
in L. However, there is a problem. In contrast to the situation for automatic 
semigroups [KO06, Proposition 2.3], a word-hyperbolic structure (using Defi- 
nition 2.2) does not uniquely determine a semigroup: the same pre-word-hy- 
perbolic structure can admit interpretations with respect to non-isomorphic 
semigroups, as the following example shows: 

Example 2.3. Let L be a pre-word-hyperbolic structure with A = {a, b,c}, 
L = A, and M = {u#i v#2 a rev : u, v G L}. (Of course, a rev = a since a is a single 
letter.) 

Let S be the two-element null semigroup {0,x}, where all products are 
equal to 0. Let T be the three-element null semigroup {0,x,y}, again with all 
products equal to 0. 

Define mappings (J) : A — > S and : A — > T by 



1VI(I) ={U#!V# 2 W 5 



,rev 



: u, v, w € L, (u(j))(v4>) = wc})}. 



ac() = 0, 
cup = 0, 



b 45 = x, 
b4> = x, 



ccb = x, 
a|> =y. 



Then LcJj = S and Li[> = T. Furthermore, 



M = {u#i v#2 a r 
= {u#i v#2 a r 
= {u#i V#2 w 1 



rev 



: u,v € L} 

: u,v € L, (u4>)(vcj)) = acf)} 
: u,v, w G L, (u4>)(v4>) = wc})}, 



rev 



.rev 
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since all products in S are equal to and a is the unique word in L mapped to 
by c}). Similarly, 

M = {u#iv#2W rev : u, v, w € L, (ut|))(v4>) = wip} 

since all products in T are equal to and a is the unique word in L in mapped 
to by 4>. 

Thus <J) and 4> are interpretations of L with respect to the non-isomorphic 
semigroups S and T respectively. 

Hence, it seems not to make sense to consider decision problems for gen- 
eral word-hyperbolic semigroups, at least with the current definitions. It 
would be illogical to ask for an algorithm that takes as input a word-hyper- 
bolic structure and determined some property of 'the' semigroup is describes, 
since there is no such unique semigroup. The fundamental problem Exam- 
ple 2.3 elucidates is that the word-hyperbolic structure 1 does not determine 
whether the two symbols b and c represent the same element or different el- 
ements. However, this problem only arises for two symbols in A; that is, for 
two words over A of length 1, in the sense that if we insist that an interpre- 
tation should consist of a map that sends distinct letters to distinct elements 
of the semigroup (that is, the map is an injection when restricted to A), then 
a word-hyperbolic structure does describe a unique semigroup up to isomor- 
phism. In order to prove this result (Proposition 2.5), we need the following 
lemma: 

Lemma 2.4. Let L be a word-hyperbolic structure. Then there is a relation E(X) C 
L(Z) x L(I), dependent only on I, such that the following are equivalent for any 
words w,x G L(I): 

1. (w,x) G E(I); 

2. wcj) = xc()/or some interpretation <£> : A(Z) + — > S with <J)|a being an injection; 

3. w<$ = x§> for any interpretation <£> : A(I) + — s> S with 4>|a being an injection. 

Proof of 2.4. Define 

E' = {(w,x) : w G L(I),x G L(I), |w| ^ |x|, |w| > 2, 

(3u,v G L(I))(u# 1 v# 2 w rev G M(I) Au#!V# 2 x rev G M(I)} 

and let 

E(I) = {(a, a) : a G A(I) n L(Z)} UE'u (E'p 1 • (2.1) 

The aim is now to show that E(Z) has the required properties. Let w, x G L(Z). 

First suppose that (1) holds; that is, that (w,x) G E(Z). Let 4> be any 
interpretation of 1. Either w, x G A(Z) n L(L), in which case w = x by (2.1) 
and so wcf) = x<J), or (w,x) £ E'u (E') _1 . Assume (w,x) G E'; the other case 
is symmetrical. Then there exist u, v G L(Z) such that u#i v#2W rev G M(L) and 
u#iv#2X rev G M(Z). Hence wcf) = (uc|))(vct)) = xc}) since cf) is an interpretation 
of I. Hence (1) implies (3). 

It is clear that (3) implies (2). Now suppose that (2) holds; that is, that 
wc)) = xcf) for some interpretation cf) : A(l) + — > S with <J)|a being an injection. 
If |w| = |x| = 1, then w,x G A and so w = x since cJ)|a is injective, and so 
(w,x) G E(I). Now suppose that at least one of |w| and |x| is greater than 1. 
Assume |w| ^ |x|; the other case is similar. Since w has at least two letters, 
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the element w$ is decomposable in S. So there are words u,v e L with 
(u4>)(v(J)) = wcf). Since wcf) = xfy, it also follows that (u4))(vct)) = xcf). Thus the 
words u#i v#2W rev and u#i v#i x rev both lie in M since 4> is an interpretation of 
I. Hence (w, x) eE'C E(I). Hence (2) implies (1). [24] 

Proposition 2.5. Lef I be a word-hyperbolic structure admitting interpretations 
4> : A + — > S and i|> : A + — > T, wz'f/2 bo£/i c})|a flwrf 4>Ia fcemg injections. Then there 
is an isomorphism xfrom S to T swc/z that (JjIlt = \p|i_. 

Proof of 2.5. Define maps t : S — > T and t' : T — > S as follows. For any s € S 
let st be wij;, where w € L is some word with wcj) = s, and for any t 6 T, let 
tr' be w'<J>, where w' € L is some word with w'i|> = t. (The words w and w' 
are guaranteed to exist since cf) and i|> are surjections.) The maps t and r' are 
well-defined as a consequence of Lemma 2.4. 

To show that t is a homomorphism, proceed as follows. Let r, s € S and 
choose u,v, w G L with ucf) = r, vfy = s, and wtj) = rs. Then tt = m^, st = vij), 
and (ts)t = wi|>, by the definition of t. Now, u#iv#2W rev G M (since cf) is an 
interpretation of L) and so (m[>)(v4>) = wi|> (since 4> is an interpretation of Z). 
Thus 

(tt)(st) = (m[>)(vi[>) = (wip) = (ts)t 

and so t is a homomorphism. 

Symmetric reasoning shows that x' : T — > S is a homomorphism. The maps 
t and x' are mutually inverse, since if w € L is such that w<J) = s and wip = t, 
then st = t and t' = s. Thus t : S — > T is an isomorphism. By the definition 
of t using elements of L, it follows that <b\i_T = 4>|l- [2I5] 

The extra condition used in Proposition 2.5, where the interpretation re- 
stricted to the alphabet A is an injection, does not restrict the class of word- 
hyperbolic semigroups: 

Proposition 2.6. Let L be a word-hyperbolic structure and : A(I) + — >■ S an 
interpretation for L with respect to a semigroup S. Then there is a word-hyperbolic 
structure TT, effectively computable from L and 4>L\(£)/ "with A(TT) C A(L), admitting 
an interpretation i[> : A(TT) + — > S wzfTz 4>lA(n) being an injection. 

Proof of 2.6. Initially, let TT = L. We will modify TT until it has the desired 
property. 

Suppose 4>Ia(tt) is n °t injective. Pick a, b e A(TT) with acj) = bcf). Replace 
every instance of b by a in words in L(TT). (This corresponds to replacing b 
by a whenever it appears as a label on an edge in a finite automaton recog- 
nizing L(TT).) Replace every instance of b by a in words in M(TT). (This corre- 
sponds to replacing b by a whenever it appears as non-terminal in a context- 
free grammar defining L(TT).) Finally, delete b from A(TT). Since acj) = bcj), 
it follows that TT is a word-hyperbolic structure admitting an interpretation 
4 ) lA(n)+ : A(TT) + - > S with respect to S. 

Since A(TT) is finite, we can iterate this process until c|)|a(tt] becomes injec- 
tive. Finally, define i\> = $\a(t\)+- HU 

In light of Proposition 2.6, we modify our definition of interpretation, to 
insist that each symbol represent a different element of the semigroup: 
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Definition 2.7 ((Improved version)). An interpretation of a pre-word-hyper- 
bolic structure 1 with respect to a semigroup S is a map cf) : A(X) + — >• S, with 
4>|a being injective, such that (L(Z))4> = S and 

M(I) = {u# 1 v# 2 w BW : u,v, w G L(I), (ucj>)(vc|)) = w<f>}. 

Again, when there is no risk of confusion, denote ucj) by u for any u G A + , 
and Xcf) by X for any X C A+. 

Therefore, a word-hyperbolic structure is henceforth a pre-word-hyperbol- 
ic structure that admits an interpretation in the sense of Definition 2.7. With 
this new definition, Proposition 2.5 shows that each word-hyperbolic structure 
describes a uniquely determined semigroup, and therefore one can sensibly 
attempt to solve questions about the semigroup using the word-hyperbolic 
structure. 

However, although a word-hyperbolic structure determines a unique semi- 
group, it does not determine a unique interpretation, even up to automorphic 
permutation. This parallels the situation for automatic semigroups [KO06, 
§ 2.2], but is also true in a rather vacuous sense for word-hyperbolic semi- 
groups, for the alphabet A(I) for a word-hyperbolic structure I for a semi- 
group S may include a symbol c that does not appear in any word in either 
L(L) or M(Z). (In this situation, c must represent a redundant generator for 
S.) 

For example, let A(I) = {a,b,c}, L(I) = {a,b}+, and M(I) ={u# lV #! (uv) rw 
u,v G {a, b} + . Then L is a word-hyperbolic structure for the free semigroup 
F with basis {x,y}: let <J) : A(I) + — > F be such that acf) = x and bcf) = y; 
regardless of how ccj) is defined, cf) is an interpretation of L with respect to F. 

Less trivial is the following example: 

Example 2.8. Let S = ({1,2} x {1,2,3}) U {Os, Is} and define multiplication on 
Sby 

(i m y) = (° s ifA = j = L 
1 (i, (J.) otherwise; 

1 sx = xl s = x for all x G S; 
s x = xO s = S for all x G S. 

Then S is a monoid. (In fact, S is a monoid formed by adjoining an identity to 
a O-Rees matrix semigroup over the trivial group.) 



Let A(I) 


= {a, 


b,c,d, e. 


, i, z}. Let L(I) = 


= {a, b, c, d, bed, deb, i, 


z}. Define 


+1 :A(Z) 


-> S 


acjJi = 


(1,1), 


bcf)! = 


(1,2), cch=(1,3), 








dch = 


(3,1), 


e4>i = 


(2,2), icj)! = l s , 


z4>i = S ; 


+ 2 :A(Z) 


-> S 


acj) 2 = 


(1,1), 


b(J) 2 = 


(1,2), c4) 2 = (l,3), 








dcb 2 = 


(3,1), 


e*2 = 


(1,3), ich = l s , 


Z$ 2 = S - 



Define 



M(I) ={u#!V# 2 w rev :u,v,w G L(I), [ufa )(v(j>i ) =w4>i>. 

Since L(Z) is finite, M(Z) is also finite and thus context-free. So L is a word- 
hyperbolic structure for S and 4>i is an interpretation for I with respect to S. 
However, 

(bed)*, = (1,2)(2,2)(2,3) = (1,3) = (1,2)(1,3)(2,3) = (bed)cf) 2 , 
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and similarly (deb)cj)i = (deb)4>2- Hence cj)i|i_ = <&2\l- Therefore 

M(I) = {u#!V# 2 w rev : u,v, w G L(I), (uteKvfo) = wcM, 

and so cJj^ is also an interpretation of L with respect to S. Moreover, there is 
no automorphism p of S such that cfj^ p = cb2- To see this, notice that such 
a p would have to map ec})i = (2,2) to e<$>2 = 0)3). The map p would also 
preserve 3?-classes. But the 3?-class of (1,3) contains the element (1,1), which 
is not idempotent (since (1 , 1 ) (1 , 1 ) =0), whereas every element of the 3?-class 
of (2, 2) is idempotent. So no such map p can exist. So the two interpretations 
are not even equivalent up to automorphic permutation of S. 

The crucial point in Example 2.8 is that Proposition 2.5 only guarantees 
that the restriction of two interpretations to L are equivalent up to automorphic 
permutation. It says nothing about the interpretation maps on the whole of 
A+. 

In order to compute with the semigroup described by a word-hyperbolic 
structure, interpretations must be coded in a finite way. 

Definition 2.9. An assignment of generators for a word-hyperbolic structure L 
is a map a : A(L) — > L(L) with the property that there is some interpretation 
<£> : A(I) + — > S such that aa<|) = acf) for all a € A; such an interpretation is 
said to be consistent with a. Two assignments of generators a and |3 for Z are 
equivalent if (aa, a|3) G E(I) for all a G A(I). 

Proposition 2.10. An assignment of generators for a word-hyperbolic structure is 
consistent with a unique interpretation (up to automorphic permutation of the semi- 
group described). Equivalent assignments of generators are consistent with the same 
interpretation. 

Conversely, every interpretation is consistent with a unique (up to equivalence) 
assignment of generators. 

Proof of 2.10. Let Z be a word-hyperbolic structure and a : A — > L an assign- 
ment of generators. Then there is an interpretation <£> : A + — > S of 1 that is 
consistent with a; that is, aac}) = acf) for all a € A. 

Let 4> : A + — >■ S be another interpretation of 1 that is consistent with a; the 
aim is to show that <{) and <}) differ only by an automorphic permutation of S. 
First, acu|> = a4> for all a G A, since i|> is consistent with a. By Proposition 2.5, 
there is an automorphism t of S such that <$>\\_r = t[>|l, and so acf)T = act^x = 
aoal> = m[> for all a G A. Hence <J) and ip differ only by the automorphism t. 

Now let (3 : A — » L be an assignment of generators equivalent to a; the 
aim is to show that (3 is also consistent with <£>. Now, (aa, a (3) G E(Z) for all 
a G A since a and (3 are equivalent. Thus acf) = aacj) = a(3cf) by Lemma 2.4, 
and hence |3 is also consistent with the interpretation c|). 

Finally, suppose y : A — > L is an assignment of generators consistent with 
4>; the aim is to show a and (3 are equivalent. Now, aacj) = acf) = aycf) for 
all a G A since a and y are consistent with <$>. Hence (aa, ay) G E(I) for all 
a G A by Lemma 2.4. [TTo] 

Definition 2.11. A word-hyperbolic structure L is said to be an interpreted 
word-hyperbolic structure if it is equipped with an assigment of generators 
<x(I). 
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Proposition 2.12. Let I be an interpreted word-hyperbolic structure for a semigroup 
S. Then there is another interpreted word-hyperbolic structure L' for S, effectively 
computable from I, such that A(Z') C L(L') and a(I') is the embedding map from 
A(I') foL(I'). 

Proof of 2.12. Let A(I') be A(I) and L(I') = L(I) U A(I). For brevity, let 
a = oc[L). For each a, b,c G A(Z'), define the languages: 



ML" 


= {a#i v#2W rev 


: (aa)# 1 v# 2 w rev G M(I)}, 




= {u#i a# 2 w rev 


:u#i(aa)# 2 w rev G M(I)}, 




= {u#iv# 2 a rev : 


u# 1 v# 2 (aa) rev G M(I)}, 




= {a#! b# 2 w rev 


: (oa)#i (ba)# 2 w rev G M(I)}, 




= {u#, a# 2 b rev : 


:u#i(aa)# 2 (ba) rev G M(I)}, 




= {a#!V# 2 b rev : 


(aa)# 1 v# 2 (ba) rev G M(I)}, 




= {a#! b# 2 c rev : 


(aa)#i (ba)# 2 (ca) rev G M(I)}. 



Each of these languages is context-free because each is the intersection of the 
context-free language M(L) with a regular language. [Notice that M-J b c is 
either empty or a singleton language.] 
Now let 

M(I') = M(I) U |J (M^UMf'uM^) 

Q6A(I) 

U U KIUM^) 

a,beA(I) 

u U <i, c ; 

a,b,ceA(I) 

notice that M(Z') is also context-free. 

Let cf) : A(L) S be an interpretation of Z. Then, recalling that A(Z) = 
A(I'), 

M(I') = {u#iv# 2 w rev : u,v, w G L(l'), (u<t>)(v<|>) = w*}, 

because u, v, and w range over L(Z') = L(Z) U A(I), and the eight cases that 
arise depending on whether each word lies in L(Z) or A(Z) correspond to the 

eight sets M(I), mL 2) , mL 3) , M^J,, M^J,, M^j,, and M^ >c . [Notice 

that these sets are not necessarily disjoint, since it is possible that act = a for 
some a G A.] 

Finally, define a(Z') to be the embedding map from A(Z') to L(Z'). This 
map is an assignment of generators since trivially ((a)a(Z'))4 ) = a <b f° r an y 
interpretation of I'. [T12I 

In light of Proposition 2.12, we will assume without further comment that 
an interpreted word-hyperbolic structure I has the property that A(Z) C L(l) 
and that ot[L) is the embedding map from A(L) to L(L). Notice further that 
the computational effectiveness aspect of Proposition 2.12 ensures we are free 
to assume that an interpreted word-hyperbolic structure serving as input to a 
decision problem has this property. 

For automatic semigroups, it is possible to assume that the automatic struc- 
ture has a further pleasant property, namely that every element of the semi- 
group is represented by a unique word in the language of representatives 



8 



[KO06, Proposition 2.9(iii)]. However, there exists a word-hyperbolic semi- 
group (indeed, a word-hyperbolic monoid) that does not admit a word-hy- 
perbolic structure where the language of representatives has this uniqueness 
property [CM12, Examples 10 & 11]. 



3 BASIC CALCULATIONS 

This section notes a few very basic facts about computing with 
word-hyperbolic structures for semigroups that are used later in the paper. 

Lemma 3.1 ([HKOT02, Proof of Lemma 3.6]). There is an algorithm that takes as 
input a word-hyperbolic structure I and two words p, q € L(I), and outputs a word 
r G L(L) satisfying pq" = f with |r| ^ c(|p| + |q|) (where c is is dependent only on 
L) in time 0((|p| + |q|) 5 ). 

[Actually, the appearance of this lemma in [HKOT02] allows p or q to be 
empty and asserts that |r| $C c(|p| + |q| + 2). To obtain the lemma above, where 
p and q are non-empty, increase c appropriately] Notice that there may be 
many possibilities for a word r with p q~ = f. 

Lemma 3.2. There is an algorithm that takes as input a word-hyperbolic structure 
L and three words p, q,r e L(Z), and decides whether p q" = f in time cubic in 
Ipl + lql + M. 

Proof of 3.2. The algorithm simply checks whether p#i q#2r rev € M(L), and the 
membership problem for context-free languages is soluble in cubic time. 



4 WORD PROBLEM 

This section is dedicated to proving that word-hyperbolic semi- 
groups have word problem soluble in polynomial time. 

As noted in the introduction, the previously-known algorithm required ex- 
ponential time [HKOT02, Theorem 3.8]. This motivated Hoffmann & Thomas 
to define a narrower notion of word-hyperbolicity for monoids that still gen- 
eralizes word-hyperbolicity for groups. By restricting to this version of word- 
hyperbolicity, one recovers automaticity [HT03, Theorem 3] and an algorithm 
that runs in time O(nlogn), where n is the length of the input words [HT03, 
Theorem 2]. Although the algorithm described below is not as fast as this, 
the existence of a polynomial-time solution to the word problem for word- 
hypebolic monoids (in the original Duncan-Gilman sense) diminishes the ap- 
peal of the Hoffmann-Thomas restricted version. 

Theorem 4.1. There is an algorithm that takes as input an interpreted word-hyper- 
bolic structure L for a semigroup and two words w,w' G A(I) + and determines 
whether w = w' in time polynomial in |w| + |w'|. 

Proof of 4.1. By interchanging w and W if necessary, assume that |w| ^ |w'|. 
First, if |w| = |w'| = 1, then w, w' G A(L) and so (since the interpretation map 
is injective on A(I)), we have w = w' if and only if w = w'. 

So assume |w| 2. Factorize w as w = w^'w^ 2 ', where w^ 1 ' = [|w|/2J. 
Notice that w = w' if and only if w' 1 ) w' 2 ' = w'. 
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figure l. Finding a word in R representing the same element as w e A*. 



By Lemma 4.2 below, there is a polynomial-time algorithm that takes the 
three words \ w' 2 ', and w' and yields words and u' in L(L) 

representing w^ 2 \ and w', of lengths at most (c + 1 )\w^ ' | 1+1 °g( c + 1 ) / 

(c + l)| W ( 2 »| 1+log(c+1) and (c + 1)|w / | 1+lo s( c+1 ' / in time polynomial in |w( 1l |, 
|w( 2 '|, and \w'\, respectively where c is a constant dependent only on L. 

It follows that w = w' if and only if u^ 1 ' u^ 2 ' = u', and, by Lemma 3.2, 
this can be checked in time cubic in lu' 1 '| + |u' 2 '| + |u'|, which, by the bounds 
on the lengths of , and u', is still polynomial in the lengths of w and 

w'. Thus the word problem for the semigroup described by L is soluble in 
polynomial time. [p] 

Lemma 4.2. There is an algorithm that takes as input an interpreted word-hyperbolic 
structure Lfor a semigroup and a words w e A(I) + and outputs a word u e L(Z) 
with w = u and |u| ^ |w|(c + 1 )|w| lo st c+1 \ and which takes time polynomial in |w|. 

Proof of 4.2. Suppose w = w\ • • • w n , where w; e A C L. Therefore Wi, . . . ,w n 
is a sequence of words in L whose concatenation represents the same element 
of the semigroup as w. 

For the purposes of this proof, the total length of a sequence Si , . . . , Sj of 
words in A* is defined to be the sum of the lengths of the words \s-\ | + . . . + \si\. 

Consider the following computation, which will form the iterative step of 
the algorithm: suppose there is a sequence of words si , . . . , si, each lying in 
L(I) and each of length at most t. Notice that It is an upper bound for the total 
length of this sequence. For i = 1 , . . . , \ l/2\, apply Lemma 3.1 to compute a 
word s{ G L(Z) representing S2i-i S2i of length at most c(|s2i-i I + Is2il) ^ 2ct. 
For each i = 1, . . . , [t/2], this takes 0((|s2i-i I + ls2il) 5 ) time, which is at worst 
0((2t) 5 ) time. Therefore the total time used is at most 0( [i/2\ (2t) 5 ), which is 
certainly no worse than time 0((€t) 5 ). That is, the total time used is at worst 
quintic in the upper bound of the total length of the original sequence. 

If I is odd, set to be s^. (If I is even, \l/2~\ = [i/2\, so sL^ 2 i has 

already been computed.) This is purely notational; no extra computation is 
done. 

The result of this computation is a sequence of \l/2~\ words, each of length 
at most 2ct, whose concatenation represents the same element of the semi- 
group as the concatenation of the original sequence. The total length of the 
result is at most (c + 1 )£t; that is, at most c + 1 times the total length of the 
previous sequence. 

Apply this computation iteratively starting with the sequence Wi , . . . ,w n 
and continuing until a sequence with only one element results. Since each 
iteration takes a sequence with I terms to one with \l/2~\ terms, there are at 
most [logn] iterations. The first iteration of this computation, applied to a 
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sequence whose total length is at most u, completes in time 0(n 5 ), yielding 
a sequence of total length at most n(c + 1 ); the next iteration completes in 
time 0((n(c + 1 )) 5 ), yielding a sequence of total length at most n(c + 1 ) 2 . In 
general the i-th iteration completes in time at most 0((n(c + 1 ) ) 5 ), yielding 
a sequence of total length at most n(c + 1 )\ So the [logn] iterations together 
complete in time at most 0((1 +logn)(n(c + l) 1+logn ) 5 ), since [logn] ^ 1 + 
log n. [Figure l illustrates the algorithm geometrically: each iteration yields a 
sequence of roughly half as many words in L(Z) labelling a sequence of arcs 
that each span a subword twice as long as the corresponding terms in the 
preceding sequence.] 

Applying exponent and logarithm laws, 

n(c + l) 1+1 °S n =n(c + l)(c + l) l0 8 n 
= n(c + l)n l0 8 (c+1) 
= (c + 1)u 1+1 °s (c+1 \ 

and so, since c is a constant, the algorithm completes in time 

0(n 5+51og(c + 1) logn ^ 

yielding a word in L(Z) of length at most n(c + 1 )n lo &( c+1 

Interestingly, although Theorem 4.1 gives a polynomial-time algorithm for 
the word problem for word-hyperbolic monoids, the proof does not give a 
bound on the exponent of the polynomial, because the constant c of Lemma 
3.1 is dependent on the word-hyperbolic structure L. There is thus an open 
question: does such a bound actually exist? or can the word problem for 
hyperbolic semigroups be arbitrarily hard within the class of polynomial-time 
problems? 

The algorithm described in Lemma 4.2 is not particularly novel. It is similar 
in outline to that described by Hoffmann & Thomas [HT03, Lemma 11] for 
their restricted notion of word-hyperbolicity in monoids. However, the proof 
that it takes time polynomial in the lengths of the input words is new. 

Hoffmann & Thomas describe their algorithm in recursive terms: to find 
a word in L(L) representing the same element as w e A*, factor w as w'w", 
where the lengths of w' and w" differ by at most 1, recursively compute 
representatives p ' and p" in L(Z) of w' and w", then compute a representative 
for W using p' and p". This last step they prove to take linear time (recall 
that this only applies for their restricted notion of word-hyperbolicity) and to 
yield a word of length at most |p'| + |p"| + 1, which shows that the whole 
algorithm takes time O(nlogn). However, this recursive, 'top-down' view of 
the algorithm obscures the fact that the overall strategy can be made to word 
even for monoids that are word-hyperbolic in the general Duncan-Gilman 
sense. It is through the iterative, 'bottom-up' view of the algorithm presented 
above that it becomes apparent that the length increase of Lemma 3.1 remains 
under control through the log n iterations. 



5 DECIDING BASIC PROPERTIES 

This section shows that certain basic properties are effectively 
decidable for word-hyperbolic semigroups. First, being a monoid is decidable: 
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Algorithm 5.1. 

Input: An interpreted word-hyperbolic structure L for a semigroup. 

Output: If the semigroup is a monoid (that is, contains a two-sided iden- 
tity), output Yes and a word in L(I) representing the identity; otherwise output 
No. 

Method: 

1. For each a € A, construct the context-free language 

I Q = {iG L: a#!t# 2 aG M} (5.1) 

and check that it is non-empty. If any of these checks fail, halt and output 
No. 

2. For each a € A, choose some t a G I a . 

3. Iterate the following step for each a G A. For each b G B, if i a b = b i Q = b, 
halt and output Yes and t a . 

4. Halt and output No. 

Proposition 5.2. Algorithm 5.1 outputs Yes and i if and only if the semigroup 
defined by I is a monoid with identity i. 

Proof of 5.2. Suppose first that Algorithm 5.1 halts with output Yes and i. Then 
by step 3, ib=bi = b for all b G A. Since A generates S, it follows that 
si = Is = s for all s G S and hence i is an identity for S. 

Suppose now that S is a monoid with identity e. Then there is some word 
w G L with w = e. For every a G A, ae = a, and so a#iw#2<i G M. Thus 
w G I a for all a G A and so each I a is non-empty. Thus the checks in step 1 
succeed and the algorithm proceeds to step 2. 

Suppose that w = W] • • • w n , where w, G A for each j = 1 , . . . , n. Then 

e = w = wi • • • w n _i w n 

= wi • • • w n _-| w n i Wn (by the choice of i Wn G I W J 
= ei Wn 

= i Wn (since e is an identity for S). 

Hence i Wn represents the identity e and so i Wn b = b i Wn = b. Thus at least 
one of the i Q chosen in step 2 passes the test of step 3 (which guarantees that it 
represents an identity since A generates S) and so the algorithm halts at step 3 
and outputs Yes and a word i a representing the identity. 

Question 5.3. Is there an algorithm that takes as input an interpreted word- 
hyperbolic structure and determines whether the semigroup it defines con- 
tains a zero? 

Notice that this cannot be decided using a procedure like Algorithm 5.1, 
or at least not obviously, because the natural analogue of I Q is 

Z Q = {z G L : a#! z# 2 z rev G MJ, 

which is naturally defined as the intersection of M and {u#iv#2V rev : u, v G 
A + }. However, testing the emptiness of an intersection of context-free lan- 
guages is in general undecidable. So using Z Q would, at minimum, require 
some additional insight into the kind of context-free languages that can appear 
as M. 
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Notice that commutativity is very easy to decide for a word-hyperbolic 
semigroup; one needs to check only that ab = ba for all symbols a, b G A(Z). 
This is simply a matter of performing a bounded number of multiplications 
and checks using Lemmata 3.1 and 3.2. 

Green's relation L is decidable for automatic semigroups; in contrast, Green's 
relation "R is undecidable, as a corollary of the fact that right-invertibility is un- 
decidable in automatic monoids [KO06, Theorem 5.1]. In contrast, and L 
are both decidable for word-hyperbolic semigroups, as a consequence of M(Z) 
describing the entire multiplication table. 

Proposition 5.4. There is an algorithm that takes as input an interpreted word-hy- 
perbolic structure I and two words w,w' G L(L) and decides whether the elements 
represented by w and w' are: 

1. ^-related, 

2. L-related, 

3. "K-related. 

Proof of 5.4. Let S be the semigroup described by L. The elements w and w' 
are IR-related if and only if there exist s, t G S such that ws = w' and w't = w. 
That is, w Jl w ' if and only if either w = w or there exist s , t G S with ws = w ' 
and w't = w. The possibility that w = w' can be checked algorithmically by 
Theorem 4.1. The existence of an element s G S such that ws = w' is equivalent 
to the non-emptiness of the language 

{v G L:w# 1 v# 2 (w') rev G MJ. 

This context-free language can be effectively constructed and its non-emptiness 
effectively decided. Similarly, it is possible to decide whether there is an ele- 
ment t G S such that w't = w. Hence it is possible to decide whether wOlw'. 

Similarly, one can effectively decide whether wLw'. Since w"Kw' if and 
only if wOlw' and w£ w', whether w and w' are JC-related is effectively de- 
cidable. [54] 

Corollary 5.5. There is an algorithm that takes as input an interpreted word-hy- 
perbolic structure and decides whether the semigroup it describes is a group. 

Proof of 5.5. Suppose the input word-hyperbolic structure is I and that it de- 
scribes a semigroup S. Apply Algorithm 5.1. If S is not a monoid, it cannot 
be a group. Otherwise we know that S is a monoid and we have a word 
i G L(Z) that represents its identity. For each a G A(L), check whether aOli 
and a Li: if all these checks succeed, then every generator is both right-and 
left-invertible, and so S is a group; if any fail, there is some generator that is 
either not right- or not left-invertible and so S cannot be a group. Hence it is 
decidable whether I describes a group. [5^5] 

Question 5.6. Are Green's relations T) and S decidable for word-hyperbolic 
semigroups? 

Note that D and 3 are both undecidable for automatic semigroups [Ott07, 
Theorems 4.1 & 4.3]. 
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6 BEING COMPLETELY SIMPLE 

This section shows that it is decidable whether a word-hyperbol- 
ic semigroup is completely simple. This is particularly useful because a com- 
pletely simple semigroup is word-hyperbolic if and only if its Cayley graph is 
a hyperbolic metric space [FK04, Theorem 4.1], generalizing the equivalence 
for groups of these properties for groups. 

Definition 6.1. Let S be a semigroup, I and A be index sets, and P be a A x I 
matrix over S whose (A, i)-th element is pA,i- The Rees matrix semigroup 
M[S; I, A; P] is defined to be the set I x S x A with multiplication 

(i, g,A)(j,h, \i) = (i,gp A jh, 

Recall that a semigroup is completely simple if it has no proper two-sided 
ideals, is not the two-element null semigroup, and contains a primitive idem- 
potent (that is, an idempotent e such that, for all idempotents f, we have 
ef = fe = f => e = f). The version of the celebrated Rees theorem due 
to Suschkewitsch [H0W95, Theorem 3.3.1] shows that all completely simple 
semigroups are isomorphic to a semigroup M[G;I,A;P], where G is a group 
and I and A are finite sets. 

Let A be an alphabet representing a generating set for a completely simple 
semigroup M[G; I, A; P]. Define maps v : A — > I and £, : A — > A by letting au 
and a£, be such that a G {au} x G x {aQ. For the purposes of this paper, we 
call the pair of maps (u, £,) the species of the completely simple semigroup. We 
first of all prove that it is decidable whether a word-hyperbolic semigroup is 
a completely simple semigroup of a particular species. 

Algorithm 6.2. 

Input: An interpreted word-hyperbolic structure L, two finite sets I and A, 
and two surjective maps v : A(Z) — > I and £, : A(Z) — > A. 

Output: If L describes a completely simple semigroup of species (u, £,), 
output Yes; otherwise output No. 

Method: At various points in the algorithm, checks are made. If any of 
these checks fail, the algorithm halts and outputs No. 

1. For each i 6 I and A € A, construct the regular language 

Li,A = {o-i • • • a n G L : at G A, a-| u = i, a n £, = A}. 

Check that each L^a is non-empty. 

2. For each i, j € I and A, |x € A, construct the context-free language 

{u#iv# 2 w rev G M:ueLi, A ,v G U,^ w G L — Lj, )M J, (6.1) 

and check that it is empty. 

3. For each i G I and A G A, choose a word w^a G L^a and construct the 
context-free language 

Ii, A = {u G L i>A : Wi,A#iU# 2 w^ G MJ. 

Check that each Lj^ is non-empty. 

4. For each i G I and A G A, choose a word u^a G I^a- 

5. For each a G A, i G I, and A G A, check that u^a = a and au^ = a. 
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6. For each a € A, i € I, and A, \± G A, calculate a word hi )a>li) A G L such that 

ht,a )M .,A = 

7. For each a G A, i. G I, and A, |i G A, check that hi, a>(X) ^ U| >a ^ = a. 

8. For each a € A, i £ I, and A, |x G A, check that 

9. For each a G A, i G I, and A, (x G A, construct the context-free language 

Vi, Q) ^A = {v G L : hi )Qj ^A#iv#2Ui ; A G M} 

and check that it is non-empty. 

10. For each a G A, i G I, and A, \i G A, choose some Vi )Q)l ^A 6 Vi )a ,n.,A and 
check that ah iiQ)HiA = u~X. 

11. Halt and output Yes. 

Lemmata 6.3 and 6.4 show that this algorithm works. 

Lemma 6.3. If Algorithm 6.2 outputs Yes, f/ie semigroup defined by the word-hyper- 
bolic structure I is a completely simple semigroup of species [v, £,). 

Proof of 6.3. Let S be the semigroup defined by the input word-hyperbolic 
structure I. Suppose the algorithm output Yes. Then all the checks in steps 
1-10 must succeed. 

For each i G I and A G A, let T^a = l-i,A- By the definition of L^a, for 
each a G A, the word a lies in L Qt) a A. By the check in step 1, each T^a is 
non-empty. 

By the check in step 2, for all i, j G I and A, \i G A, there do not exist 
u € L^a, v G Lj )M ., w G L — Li )(A with uv = w. That is, 

Ti,ATj^ C Ti,^ for all i, j G I and A, p. G A. (6.2) 

In particular, T^aT^a CTy and so each T^a is a subsemigroup of S. 

In each T^a, there is some element that stabilizes some other element w^a 
on the right (that is, that right-multiplies w^a like an identity) by the check in 
step 3. In step 4, u^a is chosen to be such an element. Let e^A = u^a". 

By the check in step 5, 

e aU) Aa = a and aei ;a ^ = a for all i G I and A G A. (6.3) 
In step 6, hi )ai( ^A is calculated for all i G I, A, U- G A, a G A so that 

hi, a ,mA = ei^ae i(A . (6.4) 
By (6.2), h iiQj ^ A G U,a- By the check in step 7, 

hi, a ,mAei,a£, = ei,^a for all i G I, A, (x G A, a G A. (6.5) 

Let i G I and A G A. Let t G T^a- Then t = aT 02 • • • IT^ for some G A. 
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Since ai az • • • a n G L^a, a-\v = i and a n £, = A. Then 



ai a 2 a 3 • • • a n 

= e i;A aT aj ■ ■ ■ a^e i)A [by (6.3), since ai u 

= ha,a 1 ,A,Aei, Ql f,a2 aj- • • a7Tei,A 

= ^M.,ai ,A,A hi ; a2,ai £,,A^i,a2f,l3 ' ' ' 1n£i,A 

= ^H,ai ,A,A n-i, ai , a i £,,A hi,a3,a2£,A6i,a3£, ■ ■ ■ Q-n.^-i,A 



— hi,ai ,A,A hi ) Q2 J ci 1 ^A h.| )a3)a2 ^ ; A ' ' ' Si,a n _i ^Cl^e^A 

[by repeated use of (6.5)] 

= Hi,ai,A v \hi > a2 ) a 1 £,Ah.i,a3,a 2 £,,A ■ ■ ■ ht, Qn , Qn _, £,,a [by (6.4)] 

Therefore the subsemigroup T^a is generated by the set of elements R^a = 
0H,a,n,A : a G A, \i G A}. 

By the check in step 8, for all i G I, A G A, and h G H^a, we have he^A = 
e^Ah = h. Since H^a generates T^a, it follows that e^x is an identity for T^a- 
So each T^a is a submonoid of S with identity e^A- In particular, each e^x is 
idempotent. 

Let i G I and A G A. By the check in step 9, every element h G H^a has a 
right inverse W in a- By the check in step 10, h/h = e^A and so W is also 
a left-inverse for h in Tt a- Thus every generator in a is both right- and 
left-invertible. Hence every element of T^a is both right- and left-invertible 
and so T^a is a subgroup of S. 

Since S is the union of the various T^a, the semigroup S is regular and 
the e^A are the only idempotents in S. Thus by (6.2), distinct idempotents 
cannot be related by the idempotent ordering. Hence all idempotents of S are 
primitive. Since S does not contain a zero (since it is the union of the Tj. a and 
(6.2) holds), it is completely simple by [H0W95, Theorem 3.3.3]. [ilj 

Lemma 6.4. If semigroup defined by the word-hyperbolic structure L is a completely 
simple semigroup of species [v, E,), then Algorithm 6.2 outputs Yes. 

Proof of 6.4. Suppose the semigroup S defined by the word-hyperbolic struc- 
ture L is a completely simple semigroup, with S = M[G; I, A; P]. For all i G I 
and A G A, let e^x be the identity of the subgroup T^a = {i} x G x {A}; that is, 
e i,A = (t-)Pa I? A). For each a G A, the element a has the form (au, g a , a£J for 
some g a G G. 

By the definition of multiplication in S, the word a-\ ■ ■ ■ a n G L represents 
an element of T^a if and only if aiv = i and a n £, = A. Hence each L^a must 
be the preimage of T^a and map surjectively onto T^a- In particular, L^a must 
be non-empty and so the checks in step 1 succeed. 

For any i, j G I and A, \x G A, we have T^aT^ C Tt )M .. Hence if u G Li )A , 
v G Lj )Vl , and w G L are such that uv = w, then w G L| ;|a . Thus the language 
(6.1) is empty for all i, j G I and A, \x G A. Hence all the checks in step 2 
succeed. 

For any i G I and A G A, if w^a G L^a, then w^a G T^a- Since T^a 
is a subgroup, Wi^i,x = w x,x, and e^A is the unique element of T^a that 
stabilizes w^x on the right. Thus the language Ii ; a is non-empty, and consists 
of words representing e^A- Hence the checks in step 3 succeed, and the words 
ui a chosen in step 4 are such that u^a = £\,x- 



= i and a n £, = A] 
[by (6.5)] 
[by (6.5)] 
[by (6.5)] 
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In a completely simple semigroup, each idempotent is a left identity within 
its own IR-class and ei )Q £, is a right identity within its own £-class [H0W95, 
Proposition 2.3.3]. Hence for each a 6 A, i G I, and A € A, we have 6 QU ^ci — a. 
and ae^af, = a. Thus the checks in step 5 succeed. 

For all a € A, i G I, and A, (x € A, 



h-i,a,(i,A T-H,a£, 
= 6i,|i.Q-£i,A£i,a£, 

= et lM .(au, g Q , a£,)(i,p^ 1 i , X)[i,Val,v a ^ 

= ei^(au,g a ,af,) 
= e^a. 

Thus all the checks in step 7 succeed. 

For all a € A, i € I, and A, |x G A, the element hi Q Vi) x lies in the subgroup 
Ti,A/ whose identity is e^A- Hence all the checks in step 8 succeed. Since all 
elements of this subgroup are right-invertible, each language Vi, a ,v.,\ is non- 
empty; hence all the checks in step 9 succeed. Finally since a right inverse is 
also a left inverse in a group, all the checks in step 10 succeed. Therefore the 



algorithm reaches step 10 and halts with output Yes. [64 



Theorem 6.5. There is an algorithm that takes as input an interpreted word-hy- 
perbolic structure L for a semigroup and decides whether it is a completely simple 
semigroup. 

Proof of 6.5. We prove that this problem can be reduced to the problem of 
deciding whether the semigroup defined by an interpreted word-hyperbol- 
ic structure I is a completely simple semigroup of a particular species [v : 
A(I) -» I, £, : A(I) -» A). 

Let S be the semigroup specified by L. Then S is finitely generated. Thus 
we need only consider the problem of deciding whether S is a finitely gen- 
erated completely simple semigroup. By the definition of multiplication in a 
completely simple semigroup (viewed as a Rees matrix semigroup), the left- 
most generator in a product determines its 3?-class (that is, the I-component 
of the product) and the rightmost generator in a product determines its L- 
class (that is, the A-component of the product). Thus there must be at least 
one generator in each 31- and £- class, and hence if S is an I x A Rees matrix 
semigroup, both |I| and jA| cannot exceed |A(1)|. 

Thus it is suffices to decide whether S is an I x A completely simple semi- 
group for some fixed choice of I and A, for one can simply test the finitely 
many possibilities for index sets I and A no larger than A(Z). 

One can restrict further, and ask whether S is completely semigroup of 
some particular species [v : A(I) — > I, £, : A(I) — )■ A), for there are a bounded 
number of possibilities for the maps surjective v and £,, so it suffices to test 
each one. 



7 BEING A CLIFFORD SEMIGROUP 

This section is dedicated to showing that being a Clifford semi- 
group is decidable for word-hyperbolic semigroups. Recall the definition of a 
Clifford semigroup: 
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Definition 7.1. Let Y be a [meet] semilattice and let {G a : a G Y} be a collec- 
tion of disjoint groups with, for all a, |3 G Y such that a ^ |3, a homomorphism 
4>a,p : G a — ^Gp satisfying the following conditions: 

1. For each a G Y, the homomorphism 4> a>a is trivial. 

2. For a, (3,y G Y with a ^ |3 ^ y, 

The set of elements of the Clifford semigroup S[Y; G a ; cfj^p] is the union of the 
disjoint groups G a . The product of the elements s and t of S, where s G G a 
and t G Gp, is 

(s4) a , aA p)(t4)|3, a Ap), (7- 2 ) 
which lies in the group G a Ap- [The meet of a and (3 is denoted a A |3.] 

Let A be an alphabet representing a generating set for a Clifford semigroup 
S[Y; G a ; 4) a)( 3]. Define a map £, : A — > Yby letting a£,be such that a G G Q ^. For 
the purposes of this paper, we call this map £, : A — >■ Y the species of the Clifford 
semigroup. [Notice that the map £, extends to a unique homomorphism £, : 
A + — v Y.] We first of all prove that it is decidable whether a word-hyperbolic 
semigroup is a Clifford semigroup of a particular species. 

Algorithm 7.2. 

Input: An interpreted word-hyperbolic structure L and a map f, : A — > Y. 

Output: If I describes a Clifford semigroup of species £, : A — >■ Y, output 
Yes; otherwise output No. 

Method: At various points in the algorithm, checks are made. If any of 
these checks fail, the algorithm halts and outputs No. 

1. For each a G Y, construct the regular language 

L K = {w £ L : wf, = a}. 

(These languages are regular since L is regular, Y is finite, and the map 
£, : A — > Y is known.) Check that each L a is non-empty. 

2. For each a, (3 G Y, construct the context-free language 

{u#,v# 2 w rev G M:uG L a ,v G L p ,w G L-L aA(3 } (7.3) 

and check that it is empty. 

3. For each a G Y, choose some word w a G L a and construct the context-free 
language 

l a = {ieL a :w a #,V a eT eM} 
and check that I a is non-empty. 

4. For each a G Y, pick some i a G I a and check that for all a, (3 G Y, i a ip = 

T-aA(3- 

5. For each a G A, check that i a £, a = ai a f, = a. For each a G Y and a G A 
check that ai a = i a a. 

6. For each a G Y and a G A such that a£, ^ a, construct the context-free 
language 

Vcc,q = {v G L a : a#!V#2i a € M} 
and check that V a) a is non-empty. 
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7. For each a G Y and a G A such that a£, ^ a, pick some v a>a G V a)a and 
check that v ajQ a = i a . 

8. Halt and output Yes. 

Lemmata 7.3 and 7.4 show that this algorithm works. 

Lemma 7.3. If Algorithm y.2 outputs Yes, f/ze semigroup described by the word-hy- 
perbolic structure L is a Clifford semigroup of species L, : A — > Y. 

Proof of 7. 3. Let S be the semigroup defined by the input word-hyperbolic 
structure L. Suppose the algorithm output Yes. Then all the checks in steps 
1-7 must succeed. 

For each a G Y, let T a = L a . By the check in step 1, all T a are non-empty. 

By the check in step 2, for every a, (3 G Y, there do not exist uGL a , vGLp, 
w G L - L a A|3 with uv = w. That is, T a T p C T aA(3 . In particular, T a T a C T„ 
and so each T a is a subsemigroup of S. 

In each T a , there is some element that right-multiplies some other element 
like an identity by the check in step 3. 

For each a G Y, the word i a represents an element e a , and the set of 
elements E = [e a : a G Y} forms a subsemigroup isomorphic to the semilattice 
Y by the check in step 4. 

By the checks in step 5, for each a G A, the element e Q ^ (which, like a, 
lies in T a ^)) acts like an identity on a (that is, e a ^a = ae a E, = a)/ and every 
element e a commutes with a. 

Let a G Y and t G T a . Then t = aTa2---a7L for some at G A with 
(ai a.2 • • • a n )£, = a. Then 

[by the check in step 6] 
[by the check in step 6] 
[by the isomorphism of E and Y] 
[by the extension of E, to A + ] 

Thus t = e a t. Similarly te a = t. Hence e a is an identity for T a . 

For each a G Y and a G A with a£, ^ a, there is an element v aiQ G T a such 
that v a>a a = av^a = e a by the checks in steps 6 and 7. Since J a is generated 
by elements a such that a£, ^ a, it follows that T a is a subgroup of S. 

Since L is the union of the various L a , the semigroup S is the union of 
the various subgroups T a . In particular, S is regular. Furthermore, the only 
idempotents in S are the identities of these subgroups; that is, the elements 
e a . Since every e a commutes with every element of A, it follows that all 
idempotents of S are central. Hence S is a regular semigroup in which the 
idempotents are central, and thus is a Clifford semigroup by [H0W95, Theo- 
rem 4.2.1]. [7^1 

Lemma 7.4. If the semigroup defined by the word-hyperbolic structure L is a Clifford 
semigroup of species £, : A — > Y, then Algorithm 7.2 outputs Yes. 

Proof of 7. 4. Suppose the semigroup S defined by the word-hyperbolic struc- 
ture (A, L, M(L)) is a Clifford semigroup, with S = S[Y; G a ; p]. For each 
a G Y, let e a be the identity of G a . The language L a clearly consists of exactly 



ai 0.2 ■ ■ ■ a n 

= e Ql £,a7e a2f ,a2"- • • e ant a^ 

= e Ql £,e Q2 £, • • • e Qn £,aTa2~- • • 

= e( Ql £,)A(a 2 f,)A---A(a n £,)aTa2 - • •Q^ 

= e( Ql a 2 - a n )£,aaTa2 • • • aTT 

= e a a7a2- • • a^. 
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those words in L that map onto G a , so L a is non-empty. Hence the checks in 
step 1 succeed. 

By the definition of multiplication in a Clifford semigroup, G a Gp C G a a p • 
Hence if u G L a , v G Lp, and w G L are such that uv = w, then w G L a/ \p- 
Thus the language (7.3) is empty for all a, |3 G Y. Hence all the checks in step 2 
succeed. 

Let a G Y. For any w a G L a , the element w a lies in the subgroup G a . Thus 
the language I a consists of precisely the words that represent elements of G a 
that right-multiply w a like an identity. Since G a is a subgroup, every element 
of l(x represents e a . Since there must be at least one such representative, I a is 
non-empty. Thus every check in step 3 succeeds. 

The identities e a form a subsemigroup isomorphic to the semilattice Y by 
the definition of multiplication in a Clifford semigroup. Thus every check in 
step 4 succeeds. 

Furthermore, every e a is idempotent and thus central in S by [H0W95, 
Theorem 4.2.1], and so every check in step 5 succeeds. 

Let a G Y and a G A be such that a£, ^ a. Let v a;a be the word represent- 
ing (ac^a^ar 1 • Then 

u^v^" = a(a4) Q £, ia ;r 1 = (acba^Hacba^ar 1 = e a . 

Hence 

^oc,a ^ ^a,a <md so all the checks in step 6 succeed. Similarly v a)a u Q 
and so all the checks in step 7 succeed. 

Therefore the algorithm reaches step 8 and halts with output Yes. [74] 

Theorem 7.5. There is an algorithm that takes as input an interpreted word-hyper- 
bolic structure I for a semigroup and decides whether it is a Clifford semigroup. 

Proof of 7. 5. We prove that this problem can be reduced to the problem of 
deciding whether the semigroup defined by an interpreted word-hyperbolic 
structure I is a Clifford semigroup with a particular species £, : A(L) — > Y. 

Let S be the semigroup specified by I. Then S is finitely generated. Thus 
we need only consider the problem of deciding whether S is a finitely gener- 
ated Clifford semigroup, whose corresponding semilattice must therefore also 
be finitely generated. A finitely generated semilattice is finite. 

So if S is a Clifford semigroup S[Y; G a ; 4> a) p], the semilattice Y must be a 
homomorphic image of the free semilattice of rank |A(Z)|, which has 2^ A ^ L ^ — 1 
elements. Thus it is suffices to decide whether S is a Clifford semigroup for 
some fixed semilattice Y, for one can simply test the finitely many possibilities 
forY. 

One can restrict further, and ask whether S is a Clifford semigroup with 
some fixed semilattice Y and some particular placement of generators into the 
semilattice of groups. (That is, with knowledge of in which group G a each 
generator a putatively lies, described by a map £, : A(L) — > Y. Of course, it is 
necessary that im £, generates Y.) There are a bounded number of possibilities 
for the map so it suffices to test each one. [7^5] 



8 BEING FREE 

This section shows that it is decidable whether a word-hyper- 
bolic semigroup is free. The following technical lemma, which is possibly of 
independent interest, is necessary. 
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Lemma 8.1. There is an algorithm that takes as input an alphabet A, a symbol #2 not 
in A, and a context-free grammar V defining a context-free language L(T) that is a 
subset o/A*#2A*, and decides whether L(T) contains a word x#2W rev where x/w. 

Proof of 8.1. Suppose V = (N, AU{#2}, P, O). [Here, N is the set of non-terminal 
symbols, A U {#2} is of course the set of terminal symbols, P the set of produc- 
tions, and O G N is the start symbol.] Since M does not contain the empty 
word (since every word in M lies in A*#2A*), assume without loss that V 
contains no useless symbols or unit productions [HU79, Theorem 4.4]. 
Let 

N# = {M G N : (3u,v G A*)(M p# 2 q)}. 

Notice that if M — > p is a production in P and M G N — N#, then every 
non-terminal symbol appearing in p also lies in N — N#. [This relies on there 
being no useless symbols in V, which means that every other non-terminal in 
P derives some terminal word.] For this reason, it is easy to compute N#. 

Suppose that M =>* uMv for some M G N — N# and u,v G (A U {#2})*- 
Then u and v cannot contain #2 since M G N — N#. Since there are no unit 
productions in P, at least one of u and v is not the empty word. Since M is 
not a useless symbol, it appears in some derivation of a word w#2X rev G L(F). 
Pumping the derivation M =>* uMv yields a word w'#2(x') rev where exactly 
one of w' = w or x' = x holds, since the extra inserted u and v cannot be on 
opposite sides of the symbol #2 since M G N — N#. Hence either w / xor 
w' 7^ x'. Hence in this case L(T) does contain a word of the given form. 

Since it is easy to check whether there is a non-terminal M G N — N# with 
M uMv, we can assume that no such non-terminal exists. Therefore any 
non-terminal M G N — N# derives only finitely many words (since any deriva- 
tion starting at M can only involve non-terminals in N — N# and by assumption 
no such non-terminal can appear twice in a given derivation). These words 
can be effectively enumerated. Let M G N — N# and let wi , . . . ,w n be all the 
words that M derives. Replacing a production S — > pMq by the productions 
S — > pwi q, S — s- pw2q, S — > pw n q does not alter L(T). Iterating this 
process, we eventually obtain a grammar V where no non-terminal symbol in 
N — N# appears on the right-hand side of a production. Thus all symbols in 
N — N# can be eliminated and we now have a grammar V with N = N#. 

Every production is now of the form M — > pS q or M — > p#2q, where 
p, q G A* and S G N. [There can be only one non-terminal on the right-hand 
side of each production, since otherwise some terminal word would contain 
two symbols #2, which is impossible.] 

We are now going to iteratively define a map 4> : N — > FG(A), where FG(A) 
denotes the free group on A, which we will identify with the set of reduced 
words on AuA - . First, define Oct = e. Now, iterate through the productions 
as follows. Choose some production M — > pSq rev such that Mcf) is already 
defined. Let z = p _1 (MiJ))q G FG(A). If S4> is undefined, set Sty = z. If Sfy is 
defined, check that S 4> and z are equal; if they are not, halt: L(T) does contain 
words w#2X rev with w 7^ x. 

To see this, suppose Scf> = z and consider the sequence of productions that 
gave us the original value of Scj): 

O -»• U! Si vf v , Si -> u 2 S 2 v 2 ev , ■ • • , Sic u k Sv r k ev , 

which implies that Scj) = (ui U2 • • • uij -1 vi V2 • • -v^, and the sequence that 
gave us Mcf): 

O pi Mi q r f v , Mi -> p 2 M 2 q r 2 ev , . . . , M k Pl Mqf v , 
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which implies that M<J) = (pip 2 • • • pi) qi q 2 • • • qi. Choose r, s G A* such 
that S ^* r# 2 s rev . Then L(T) contains both both U-t • ■ ■ u k r# 2 s rev v r k ev ■ ■ ■ v i ev 
and (recalling that M — > pSq rev is a production) pi • • • pipr# 2 s rev q rev q I [ ev • • • qi . 
Suppose ui • • -uicT = vi • • -VkS and pi • • -pipr = qi • • • qiqs. Then Scf) = 
(ut •••u k ) _1 v 1 •••v k = rs" 1 = (pi •••pip) _1 qi • • • qiq = p _1 (M4))q = z, 
which is a contradiction. 

Once we have iterated through all the productions of the form M — > pSq rev , 
iterate through the productions of the form M — > p# 2 q rev , and check that 
p _1 (M.(J))q. If this check fails, halt: L(T) does contain words w# 2 x rev with 
w ^ x; the proof of this is very similar to the previous paragraph. 

Finally, notice that if the iteration through all the productions completes 
with all the checks succeeding, a simple induction on derivations, using the 
values of M<J), shows that all words w# 2 x rev G L(F) are such that w = x. |8.i| 

Algorithm 8.2. 

Input: An interpreted word-hyperbolic structure L. 

Output: If L describes a free semigroup, output Yes; otherwise output No. 
Method: 

1. For each a € A, iterate the following: 

(a) Construct the context-free language 

D a = {uv : u#!V# 2 a rev G MJ. 

(b) Check whethe D a is empty. If it is empty, proceed to the next interation. 
If it is non-empty, choose some word d a G D Q . If d Q contains the letter 
a, halt and output No. If d a does not contain the letter a, define the 
rational relations 

Ql = ({(a,d Q )}U{(b,b) :b G A-{a}}) + 
Qm = ({(a,d a )}U{(b,b) :b G A-{a}}) + #! 

({(a,d Q )}U{(b,b):bGA-{a}}) + # 2 

({(a,d r a ev )}U{(b,b):bGA-{a}}) + . 

Modify I as follows: replace A by A — {a}; replace L by L o Q L ; and 
replace M by M o Q M , and proceed to the next iteration. 

2. If L / A+, halt and output No. 

3. Define the rational relation 

> = {(o,a):a€A}U{(#i,E),(# 2 ,#2)}. 

Let N = MoT. Using the method of Lemma 8.1, check whether N contains 
any word of the form x# 2 w rev with x / w. If so, halt and ouput No. 
Otherwise, halt and output Yes. 

Lemmata 8.3 to 8.5 show that this algorithm works. 

Lemma 8.3. If Lis a word-hyperbolic structure for a semigroup S, then the replace- 
ment I produced in step i(b) is also a word-hyperbolic structure for a semigroup 
S. 



22 



Proof of 8.3. If the language D a is non-empty, then any word w G D Q is such 
that W = a. In particular, d a = a. Furthermore, since d Q G (A — {a})*, we see 
that d is a redundant generator. The rational relation Qj_ relates any word in 
A + to the corresponding word in (A — {a}) + with all instances of the symbol 
a replaced by the word d a . The rational relation Qm relates any word in 
A+#! A+# 2 A+ to the corresponding word in (A-{a})+#i (A-{a}) + # 2 (A-{a})+ 
with all instances of the symbol a before #2 replaced by the word d Q and all 
instances of the symbol a after #2 replaced by the word d^ v . Hence 

MoQ M C (LoQ L )# 1 (LoQ L )# 2 (LoQ L ) rev . 

Since application of rational relations preserves regularity and context-freedom, 
LoQ L is regular and M o Q M is context-free. Finally, since a = d Q , we see that 
L o Q L maps onto S, and similarly MoQ M describes the multiplication of 
elements of S in terms of representatives in L. [sTJ 

Lemma 8.4. If Algorithm 8.2 outputs Yes, the semigroup defined by the word-hyper- 
bolic structure I is a free semigroup. 

Proof of 8.4. The algorithm can only halt with output Yes in step 3, so the algo- 
rithm must pass step 2 as well. Hence the language of representatives is A + . 
Let S be the semigroup defined by L and let 4> : A + — > S be an interpretation. 

Suppose for reductio ad absurdum that 4> is not injective. Then there are dis- 
tinct words u, v G A* such that ucf) = v$. Since 4>|a is injective by definition, at 
least one of u and v has length 2 or more. Interchanging u and v if necessary, 
assume |u| ^2. So u = u'u", where u' and u" are both non-empty. Since 
L = A+, we have u',u" G L and so u'# 1 u"# 2 v rev G M. Hence u# 2 v rev G N. 
But since the algorithm outputs Yes at step 3, there is no word x#2W rev G M 
with x ^ w. This is a contradiction and so is injective. 

So c|) : A + — > S is an isomorphism and so S is free. [84 

Lemma 8.5. If the word-hyperbolic structure L defines a free semigroup, Algorithm 
8.2 outputs Yes. 

Proof of 8.5. Let B + be the semigroup defined by I. Let : A + — > B + be an 
interpretation. Since elements of B are indecomposable, B C Acf). 

In step 1, the algorithm iterates through each a G A. For each a G Bcb 1 C 
A, since afy is indecomposable, the language D a is empty and the algorithm 
moves to the next iteration. 

Let a G A — B4? 1 . Then acf) has length (in B + ) at least two and so is 
decomposable. Hence there exist u,v G L such that uv G D Q . Furthermore, 
since ucf) and v<J) must be shorter (in B + ) than cuf), neither u nor v can include 
the letter a. Hence the replacement of L described in step i(b) takes place. 
Since this occurs for all a G A — Bcb 1 , at the end of step 1 we have a word- 
hyperbolic structure I with A = Bcb 1 . Since 4>|a is injective, 4>|a must be a 
bijection from A to B. Hence the homomorphism : A + — > B + must be an 
isomorphism, and so L = A + ; thus the check in step 2 is successful. Therefore 

M = {u#!V# 2 (uv) rev : u, v G A+} 

and so 

M o IP = {w# 2 w rev : w G A+}. 

Thus the check in step 3 is successful and the algorithm terminates with out- 
put Yes. [^5" 
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Thus, from Lemmata 8.4 and 8.5, we obtain the decidability of freedom for 
word-hyperbolic semigroups: 

Theorem 8.6. There is an algorithm that takes as input an interpreted word-hyper- 
bolic structure Lfor a semigroup and decides whether it is a free semigroup. 



9 OPEN PROBLEMS 

This conclusing section lists some important question regarding 
decision problems for word-hyperbolic semigroups. 

Question 9.1. Is there an algorithm that takes as input an interpreted word- 
hyperbolic structure for a semigroup and decides whether that semigroup is 
(a) regular, (b) inverse? 

Whether these properties are decidable for automatic semigroups is cur- 
rently unknown. 

Question 9.2. Is there an algorithm that takes as input an interpreted word- 
hyperbolic structure for a semigroup and decides whether that semigroup is 
left-/ right-/ two-sided-cancellative? 

Cancellativity and left-cancellativity are undecidable for automatic semi- 
groups [Caio6]. Right-cancellativity is, however, decidable [KO06, Corollary 3.3] 

Question 9.3. Is there an algorithm that takes as input an interpreted word- 
hyperbolic structure for a semigroup and decides whether that semigroup is 
finite? 

The equivalent question for automatic semigroups is easy: one takes an 
automatic structure, effectively computes an automatic structure with unique- 
ness, and checks whether its regular language of representatives is finite. 
However, this approach cannot be used for word-hyperbolic semigroups, be- 
cause there exist word-hypebolic semigroups that do not admit word-hyper- 
bolic structures with uniqueness indeed, they may not even admit regular 
languages of unique normal forms [CM12, Examples 10 & 11]. 

Question 9.4. Is there an algorithm that takes as input an interpreted word- 
hyperbolic structure for a semigroup and decides whether that semigroup 
admits a word-hyperbolic structure with uniqueness? (That is, where the 
language of representatives maps bijectively onto the semigroup.) If so, it is 
possible to compute a word-hyperbolic structure with uniqueness in this case? 
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